Robust Reinforcement Learning with Relevance Vector Machines
نویسندگان
چکیده
Function approximation methods, such as neural networks, radial basis functions, and support vector machines, have been used in reinforcement learning to deal with large state spaces. However, they can become unstable with changes in the samples state distributions and require many samples for good estimations of value functions. Recently, Bayesian approaches to reinforcement learning have shown advantages in the explorationexploitation tradeoff and in lower sampling costs. This paper proposes a novel reinforcement learning framework that uses the relevance vector machines (RVM) as a function approximator, which incrementally accumulates knowledge from experiences based on the sparseness of the RVM model. This gradual knowledge construction process increases the stability and robustness of reinforcement learning by preventing possible forgetting. In addition, RVM’s low sampling costs improve the learning speed. The approach is examined in the popular benchmark problems of pole-balancing and mountain car.
منابع مشابه
A Bayesian Reinforcement Learning framework Using Relevant Vector Machines
In this work we present an advanced Bayesian formulation to the task of control learning that employs the Relevance Vector Machines (RVM) generative model for describing value functions. The key aspect of the proposed method is the design of the discount return as a generalized linear model that constitutes a well-known probabilistic approach. This allows to augment the model with advantageous ...
متن کاملAdaptive and Efficient Image Retrieval with One-Class Support Vector Machines for Inter-Query Learning
We present an extension of previous work on improving the initial image retrieval set by exploiting both intra and inter-query learning. In most Content-Based Image Retrieval (CBIR) systems based on Relevance Feedback (RF), all prior experience is lost whenever a user generates a new query, thus inter-query information is not used. In previous work, a system was developed that learns One-class ...
متن کاملA Comparative Study of Extreme Learning Machines and Support Vector Machines in Prediction of Sediment Transport in Open Channels
The limiting velocity in open channels to prevent long-term sedimentation is predicted in this paper using a powerful soft computing technique known as Extreme Learning Machines (ELM). The ELM is a single Layer Feed-forward Neural Network (SLFNN) with a high level of training speed. The dimensionless parameter of limiting velocity which is known as the densimetric Froude number (Fr) is predicte...
متن کاملOn Robustness and Regularization of Structural Support Vector Machines
Previous analysis of binary support vector machines (SVMs) has demonstrated a deep connection between robustness to perturbations over uncertainty sets and regularization of the weights. In this paper, we explore the problem of learning robust models for structured prediction problems. We first formulate the problem of learning robust structural SVMs when there are perturbations in the sample s...
متن کاملState generalization method with support vector machines in reinforcement learning
The conventional reinforcement learning assumes discrete state space. Therefore, it is necessary to make states discrete in order to handle continuous state environments. However, if a simple discretization is applied, the number of states increases exponentially with the dimension of the state space, and the learning time increases. In this paper, we propose a state generalization that is able...
متن کامل